Using Markov decision processes to optimise a non-linear functional of the final distribution, with manufacturing applications

نویسنده

  • E. J. Collins
چکیده

We consider manufacturing problems which can be modelled as finite horizon Markov decision processes for which the effective reward function is either a strictly concave or strictly convex functional of the distribution of the final state. Reward structures such as these often arise when penalty factors are incorporated into the usual expected reward objective function. For convex problems there is a Markov deterministic policy which is optimal, but for concave problems we usually have to consider the larger class of Markov randomised policies. In the natural formulation these problems cannot be solved directly by dynamic programming. We outline alternative iterative schemes for solution and show how they can be applied in a specific manufacturing example.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application of Markov Processes to the Machine Delays Analysis

Production and non-productive equipment and personnel delays are a critical element of any production system. The frequency and length of delays impact heavily on the production and economic efficiency of these systems. Machining processes in wood industry are particularly vulnerable to productive and non-productive delays. Whereas, traditional manufacturing industries usually operate on homoge...

متن کامل

Probabilistic Power Distribution Planning Using Multi-Objective Harmony Search Algorithm

In this paper, power distribution planning (PDP) considering distributed generators (DGs) is investigated as a dynamic multi-objective optimization problem. Moreover, Monte Carlo simulation (MCS) is applied to handle the uncertainty in electricity price and load demand. In the proposed model, investment and operation costs, losses and purchased power from the main grid are incorporated in the f...

متن کامل

Estimating Stock Price in Energy Market Including Oil, Gas, and Coal: The Comparison of Linear and Non-Linear Two-State Markov Regime Switching Models

A common method to study the dynamic behavior of macroeconomic variables is using linear time series models; however, they are unable to explain nonlinear behavior of the series. Given the dependency between stock market and derivatives, the behavior of the underlying asset price can be modeled using Markov switching process properties and the economic regime significance. In this paper, a two-...

متن کامل

Design of supply chain in fuzzy environment

Nowadays, customer expectations are increasing and organizations are prone to operate in an uncertain environment. Under this uncertain environment, the ultimate success of the firm depends on its ability to integrate business processes among supply chain partners. Supply chain management emphasizes cross-functional links to improve the competitive strategy of organizations. Now, companies are ...

متن کامل

A new approach for Robot selection in manufacturing using the ellipsoid algorithm

The choice of suitable robots in manufacturing, to improve product quality and to increase productivity, is a complicated decision due to the increase in robot manufacturers and configurations. In this article, a novel approach is proposed to choose among alternatives, differently assessed by decision makers on different criteria, to make the final evaluation for decision-making. The approach i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001